An Ontology-based Mapping Repository for Dynamic and Customized Data Integration
نویسندگان
چکیده
There exist an increasing number of (semi-)structured information sources over the Web. Data integration techniques are often applied to increase the access efficiency of heterogeneous information sources. Data integration systems are built by combining a set of data sources for meeting specific user requirements. Due to various user needs, a growing trend of increasing variety of data integration system is noticeable even for the same application. In the academic community, much effort has been devoted to building a single data integration system. However, the existing approaches are impractical when the scale of data integration systems becomes large. This article proposes a key component of a total solution for holistically constructing a large number of customized data integration systems with the assistance of a considerable number of cooperative humans (a.k.a, a community). This component is an ontology-based mapping repository, called M-Ontology. Since each data integration system requires mappings for resolving semantic interoperability among information sources, MOntology aims at the efficient storage, management and discovery of mappings. M-Ontology is shared by all the data integration systems in the same application domain for mapping sharing and reuse. The internals of the mapping model are semantics-based for easy understanding and management by (even non-technical) community members. Following this model, a “humanfriendly” mapping insertion algorithm is proposed for construction of a well-organized mapping ontology through incremental insertion of domain-specific mapping instances. Three types of human intervention strategies (i.e., validation, avoidance and prevention) are designed for improving the performance of construction algorithms. In addition, this article presents a mapping discovery algorithm to find the many-to-many complex mappings by leveraging the mapping ontology. The discovered mappings are recycled to further enrich the mapping ontology using the insertion algorithm. Finally, we conduct two sets of experiments on real-world data integration over the Web, and the results show that the approach is feasible and effective.
منابع مشابه
A community based approach to managing ontology alignments
The Semantic Web is rapidly becoming a defacto distributed repository for semantically represented data, thus leveraging on the added on value of the network effect. Various ontology mapping techniques and tools have been devised to facilitate the bridging and integration of distributed data repositories. Nevertheless, ontology mapping can benefit from human supervision to increase accuracy of ...
متن کاملO Ontology-Based Data Access and Integration
An ontology-based data integration (OBDI) system is an information management system consisting of three components: an ontology, a set of data sources, and the mapping between the two. The ontology is a conceptual, formal description of the domain of interest to a given organization (or a community of users), expressed in terms of relevant concepts, attributes of concepts, relationships betwee...
متن کاملDemo: A community based approach for managing ontology alignments
The Semantic Web is rapidly becoming a defacto distributed repository for semantically represented data, thus leveraging on the added on value of the network effect. Various ontology mapping techniques and tools have been devised to facilitate the bridging and integration of distributed data repositories. Nevertheless, ontology mapping can benefit from human supervision to increase accuracy of ...
متن کاملAn Approach to Conceptual Ontology Integration with an Ontology Repository and a Rule Base
There exist a lot of ontologies that together can enrich knowledge within a domain or cross several related domains; and can provide advanced services based on these integrated ontologies. However, the concepts are often described differently in various ontologies, which create problems for the ontology integration. In this paper, an approach of an ontology repository and a rule base is propose...
متن کاملData Definition Ontology for clinical data integration and querying
This paper describes an approach to build a Data Definition Ontology (DDO) in the context of full domain ontology integration with datasets in order to share and query clinical heterogeneous data repositories. We have adapted an existing semantic web tool (D2RQ) to implement a process that automatically generates the DDO from a database information model, thanks to reverse engineering and schem...
متن کامل